LPC-based inversion of the DRM articulatory model

نویسنده

  • Sacha Krstulovic
چکیده

Articulatory representations are expected to bring better speech recognition results. This requires to estimate the parameters of a speech production model from the speech sound, problem known as acoustico-articulatory inversion. Known methods to solve this problem usually introduce a heavy computational cost. Alternately, it is known that Linear Prediction analysis offers an analogy with acoustic filtering. This analogy had been exploited to develop a less expensive analytic method applicable to the estimation of tube shapes discretized in equallength sections. We have extended the method to the DRM case, where the tube is made of unequal-length sections. The proposed DRM inversion scheme is thus simpler and faster. Furthermore, it shows good performance in terms of low residual modeling error. It also enhances speech recognition results when used to compute Log Area Ratios.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Lpc Modeling with Speech Production Constraints

Despite the approximations it supposes, performing LPC-based acoustico-articulatory inversion is justified in some applicative frameworks. By illustrating this assertion through experiments aiming at incorporating speech production constraints from the DRM model and from a factor-based model into an LPC modeling scheme, we promote the use of LPC-based inversion as an interface between Productio...

متن کامل

Acoustic-to-articulatory inversion using a speaker-normalized HMM-based speech production model

Acoustic-to-articulatory inverse mapping is a difficult problem because of its non-linear and oneto-many characteristics. We have previously developed a speech inversion method using a hidden Markov model (HMM)-based speech production model which takes into account the phonemespecific dynamic constraints of articulatory parameters. We found that the constraint significantly decreases the estima...

متن کامل

Relating LPC modeling to a factor-based articulatory model

This paper proposes a method for recovering the articulatory parameters of a factor-based vocal tract shape model from the speech waveform. This is realized by analytically relating the shape model to a Linear Prediction lattice lter. Results pertaining to human vowels are presented. They show a good agreement with phonetic characteristics in a real-time computational framework.

متن کامل

Unsupervised vocal-tract length estimation through model-based acoustic-to-articulatory inversion

Knowledge of vocal-tract (VT) length is a logical prerequisite for acoustic-to-articulatory inversion. Prior work has treated VT length estimation (VTLE) and inversion largely as separate problems. We describe a new algorithm for VTLE based on acoustic-to-articulatory inversion. Our inversion process uses the Maeda model (MM, [1,2]) and combines global search [3] and dynamic programming for tra...

متن کامل

Sparse smoothing of articulatory features from Gaussian mixture model based acoustic-to-articulatory inversion: benefit to speech recognition

Speech recognition using articulatory features estimated using Acoustic-to-Articulatory Inversion (AAI) is considered. A recently proposed sparse smoothing approach is used to postprocess the estimates from Gaussian Mixture Model (GMM) based AAI using MinimumMean Squared Error (MMSE) criterion. It is well known that low-pass smoothing as post-processing improves the AAI performance. Sparse smoo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999